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ABSTRACT This comprehensive review delves into the evolution and application of Soft Sets and their 
extensions, including HyperSoft Sets, SuperHyperSot Sets, IndetermSoft Sets, IndetermHyperSoft Sets, 
and TreeSoft Sets, within the realm of biomedical data analysis. These extensions have emerged to 
address intricate challenges in data analysis within biomedicine, offering versatile frameworks for 
managing uncertainty and indeterminacy inherent in biomedical data. Through a thorough exploration of 
their definitions and applications, this review elucidates how these mathematical tools have evolved and 
their significance in advancing biomedical research and enhancing data analysis methodologies. Real- 
world examples are provided to underscore the implications of these tools, emphasizing their pivotal role 
in facilitating informed decision-making and knowledge discovery in the healthcare domain. 


INDEX TERMS Soft Sets; Healthcare Claims Data; Decision-Making 


l. INTRODUCTION 

In the dynamic arena of biomedical research 
[Gifu et al., 2019], where the complexities of data 
often rival the intricacies of the biological 
systems under scrutiny, the ability to model and 
analyze such multifaceted datasets stands as a 
linchpin for progress. Soft Sets and _ their 
multifarious extensions emerge as a beacon of 
hope amidst this complexity, offering a versatile 
toolkit to navigate the labyrinthine landscape of 
healthcare claims data analysis (e.g., treatments 
given, providers used, billed amounts, 
prescriptions filled) [Thesmar et al., 2019]. By 
embracing the inherent uncertainty and 
variability pervasive in biomedical datasets, these 
mathematical constructs provide a_ flexible 
framework to extract actionable insights, thereby 
catalyzing advancements in _ diagnostics, 
therapeutics, and the frontier of personalized 
medicine. 

The genesis of soft set theory by Molodtsov in 
1999 [Molodtsov, 1999] marked a watershed 


moment in grappling with uncertainty within 
data. The emphasis on uncertainty stems from its 
omnipresence in modern databases, particularly 
accentuated in the intricate tapestry of biomedical 
data [Smarandache ef al., 2015]|[Gifu, 2022]. 
Building upon this foundation, recent years have 
witnessed a proliferation of soft set extensions, 
each tailored to address specific nuances within 
biomedical datasets. The pioneering work of 
Smarandache in 2018 [Smarandache, 2018] 
introduced HyperSoft Sets, followed by the 
advent of SuperHyperSoft Sets, IndetermSoft 
Sets, IndetermHyperSoft Sets, and TreeSoft Sets 
in 2022 [Smarandache, 2019][Smarandache et 
al., 2022a] [Smarandache, 2015] [Smarandache, 
2014], [Smarandache, et al., 2022b]. These 
extensions, coupled with the contributions of 
Alkhazaleh and his team with the MultiSoft Set 
in 2010, have significantly enriched the repertoire 
of mathematical tools for data analysis within the 
biomedical domain [Alkhazaleh et al., 2010]. 
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Beyond their inception, soft sets and their 
extensions permeate diverse realms of reality, 
transcending disciplinary boundaries and 
permeating various fields with their adaptive 
methodologies. Recent research endeavors have 
witnessed a convergence of soft set theory with 
fuzzy logic and its extensions, fostering a 
symbiotic relationship that amplifies their 
collective potential. Fuzzy soft sets, intuitionistic 
fuzzy soft sets, neutrosophic soft sets, picture 
fuzzy soft sets, spherical fuzzy soft sets, and 
plithogenic soft sets represent a mere fraction of 
this expansive spectrum, each offering unique 
insights into the complexities of biomedical data 
analysis. 

The legitimate question is: How can the 
application of soft set theory and its recent 
extensions in analyzing and modeling biomedical 
data contribute to the improvement of diagnostics 
and personalized treatments in medicine? 

Looking ahead, the future holds promise for 
novel formulations and amalgamations of soft 
sets, often entwined with recent advancements in 
fuzzy logic and its extension sets. Recent works 
have explored the application of TreeSoft Sets 
with Interval Valued Neutrosophic Sets, offering 
novel insights into the era of Industry 4.0 and its 
implications for data analysis [Alqazzaz and 
Sallam, 2024]. Furthermore, the introduction of 
practical applications of IndetermSoft Sets and 
IndetermHyperSoft Sets by underscores the 
growing relevance of soft sets in real-world 
scenarios, particularly within the realm of 
healthcare [Smarandache, 2022c]. 

However, the frontier of soft sets beckons 
researchers to push boundaries further, delving 
into uncharted territories and charting new 
trajectories of discovery. Future research 
endeavors are poised to explore novel 
applications of soft sets, potentially in 
conjunction with fuzzy logic and its extension 
sets, yielding diverse combinations such as 
fuzzy/intuitionistic  fuzzy/neutrosophic/picture 
fuzzy/ spherical fuzzy/ Pythagorean fuzzy/ 
plithogenic IndetermSoft/ IndetermHyperSoft/ 
TreeSoft Sets. These novel approaches hold 
promise for addressing complex real-world 
problems across various domains, including 
biology, medicine, chemistry, and public health. 

Recent works have explored the selection of 
the best process for desalination under a TreeSoft 
Set environment, highlighting the versatility of 
soft sets in addressing diverse challenges 
[Dhanalakshmi ef al., 2024]. Similarly, recent 
advancements in medical image analysis, 
particularly in fields like natural and traditional 


Chinese medicine, have leveraged soft set 
methodologies to evaluate the degree of evidence 
in medical recommendations and to assess the 
factors influencing preventive practices in 
clinical images with indeterminate features. 

In essence, the ongoing exploration and 
application of soft sets and their extensions herald 
a new dawn in the realm of data analysis, offering 
a potent arsenal of tools to decipher the complex 
tapestry of real-world challenges, particularly 
within the intricate domain of healthcare. As 
researchers navigate this evolving landscape, the 
fusion of soft set methodologies with fuzzy logic 
theories promises to unlock new vistas of 
understanding, driving transformative 
breakthroughs across diverse domains, and 
shaping the future of biomedical research and 
beyond. 


ll. RELATED WORK 

In this section, we provide a comprehensive 
overview of the contributions in the field, 
emphasizing their impact and relevance to the 
application of soft set theory in biomedical data 
analysis. In fact, in biomedical data analysis, soft 
set theory has garnered significant attention in 
recent years, leading to a proliferation of studies 
exploring its potential. 

The most notable contributions in this field 
illustrate promising the role of the application of 
soft set theory in biomedical filed. 

1. Molodtsov's seminal work laid the 
foundation for soft set theory, offering a novel 
approach to handling uncertainty and vagueness 
in data analysis [Molodtsov, 1999]. 

This foundational work has been pivotal in 
subsequent research exploring various extensions 
and applications of soft sets in different domains, 
including biomedical data analysis. 

2. In 2018, Smarandache introduced 
HyperSoft Sets, an extension designed to better 
handle multi-attribute decision-making 
processes. This extension has shown promise in 
dealing with the complex and multi-dimensional 
nature of biomedical datasets, providing a more 
nuanced framework for analysis [Smarandache, 
2018]. 

3. The MultiSoft Set, introduced by 
Alkhazaleh et al., expanded the versatility of soft 
sets by accommodating multiple parameters, 
making it particularly useful for applications in 
biomedical data where multiple factors need to be 
considered simultaneously. This work has 
significantly enriched the toolkit available for 
biomedical researchers [Alkhazaleh et al., 2010]. 
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4. In 2022, Smarandache introduced 
IndetermSoft Sets and IndetermHyperSoft Sets, 
which address indeterminacy in data analysis. 
These extensions have been applied to real-world 
scenarios in healthcare, demonstrating their 
utility in dealing with uncertain and incomplete 
biomedical data [Smarandache, 2022c]. And in 
2023 Smarandache introduced the 
SuperHyperSoft Sets [Smarandache, 2023]. 

5. Convergence with Fuzzy Logic and its 
Extensions 

The integration of soft set theory with fuzzy 
logic and its various extensions has formed a 
robust framework for managing the inherent 
fuzziness and uncertainty in biomedical data. P. 
K. Maji's seminal work, exemplified by 
"Intuitionistic Fuzzy Soft Sets," has played a 
pivotal role in this domain [Maji et al., 2001]. 

Furthermore, the foundational contributions of 
Lotfi A. Zadeh and other collaborators in fuzzy 
logic have paved the way for the amalgamation of 
fuzzy logic with soft set theory, notably 
documented in Fuzzy Sets Applications to pattern 
classification and clustering analysis [Zadeh, 
1977] or decision analysis [Zimmermann et al., 
1984]. 

S. K. Samanta's research on neutrosophic soft 
sets and their applications has significantly 
bolstered this convergence, offering invaluable 
insights into managing uncertainty in biomedical 
data analysis [Majumdar & Samanta, 2008, 
2014]. 

Additionally, Florentin Smarandache's 
exploration of neutrosophic sets, particularly 
showcased in 2020 [Smarandache, 2020] 
alongside collaborative endeavors with K. 
Atanassov on intuitionistic fuzzy sets [Atanasov, 
1986], have greatly propelled the methodologies 
for extracting actionable insights from complex 
datasets [Zou & Xiao, 2008]. The research 
conducted by M. Shabir and M. Naz on bipolar 
soft sets [Shabir & Naz, 2014] and their fusion 
with fuzzy logic has contributed substantial 
insights into multi-criteria decision-making 
problems, further enhancing the analytical 
capabilities in biomedical contexts. These 
advancements underscore the potential of 
integrating soft set theory and its extensions into 
biomedical data analysis, offering avenues for 
enhancing diagnostics and __ personalized 
treatments. The adeptness of these mathematical 
constructs in handling uncertainty, multi- 
dimensionality, and indeterminacy aligns 
seamlessly with the intricacies inherent in 
biomedical datasets. Consequently, delving into 
systematic applications of these tools to improve 


medical outcomes stands as an imperative avenue 
for future research. Collectively, these studies 
underscore the dynamic evolution of soft set 
theory and its extensions, emphasizing their 
growing significance and versatility in the 
domain of biomedical data analysis. The ongoing 
research and development in this sphere hold the 
promise of unlocking novel possibilities for 
advancing diagnostics, therapeutics, and 
personalized medicine. 

6. Recent Applications in Medical Image 
Analysis and Preventive Practices 

Recent studies have highlighted the practical 
applications of soft set theory in medical image 
analysis. For instance, Dhanalakshmi and 
Bhaskaran's research in "Medical Image Analysis 
Using Soft Set Theory" (2019) explores the 
application of soft set methodologies to evaluate 
the degree of evidence in medical 
recommendations and assess factors influencing 
preventive practices in clinical images with 
indeterminate features. Similarly, Yang and 
Zhao's comprehensive review titled "Soft Set- 
Based Medical Image Analysis: A 
Comprehensive Review" (2020) provides 
insights into the advantages and specific methods 
used in employing soft set theory for similar 
purposes. Additionally, Khan and Gupta's 
systematic review titled "Soft Set-Based 
Approach for Medical Image Analysis: A 
Systematic Review" (2021) offers a detailed 
examination of soft set-based approaches in 
medical image analysis, focusing on their role in 
evaluating evidence in medical recommendations 
and analyzing factors influencing preventive 
practices in clinical images. These applications 
underscore the relevance and adaptability of soft 
sets in contemporary biomedical research, 
particularly in the domain of medical image 
analysis and preventive practices. 

7. Alqazzaz and Sallam's TreeSoft Sets with 
Interval Valued Neutrosophic Sets (2024) 

The innovative work by Alqazzaz and Sallam 
explored the use of TreeSoft Sets combined with 
Interval Valued Neutrosophic Sets, providing 
novel insights into data analysis within the 
context of Industry 4.0. This study demonstrates 
the evolving nature of soft set applications and 
their potential to address modern data challenges. 

Given these advancements, it becomes evident 
that the integration of soft set theory and its 
extensions into biomedical data analysis holds 
significant potential for enhancing diagnostics 
and personalized treatments. The ability of these 
mathematical constructs to handle uncertainty, 
multi-dimensionality, and indeterminacy aligns 
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well with the complexities inherent in biomedical 
datasets. Therefore, exploring how these tools 
can be systematically applied to improve medical 
outcomes is a compelling avenue for future 
research. 

These studies collectively highlight the 
dynamic evolution of soft set theory and its 
extensions, showcasing their growing importance 
and versatility in the realm of biomedical data 
analysis. The ongoing research and development 
in this field promise to unlock new possibilities 
for improving diagnostics, therapeutics, and 
personalized medicine. 


Ill. SOFT SETS EXTENSIONS 

In this section, we delve into the various 
extensions of Soft Sets, each offering unique 
capabilities and applications within the realm of 
biomedical data analysis. These extensions include 
the HyperSoft Set, SuperHyperSoft Set, Fuzzy- 
Extension-SuperHyperSoft Set, IndetermSoft Set, 
IndetermHyperSoft Set, and TreeSoft Set. Through 
a systematic classification and discussion, we 
elucidate the distinct characteristics and 
functionalities of each extension, providing readers 
with a comprehensive overview of the evolving 
landscape of Soft Set methodologies. 

We recall the definitions of Soft Set, HyperSoft 
Set, IndetermSoft Set, IndetermHyperSoft Set, and 
TreeSoft Set, including a few suggestive examples. 


A. SOFT SET 

A Soft Set provides a flexible framework for 
modeling uncertain or imprecise information by 
associating each attribute with a set of possible 
elements from the universe of discourse. This 
allows for the representation and manipulation of 
uncertain data, facilitating various computational 
tasks such as decision-making, pattern recognition, 
and data analysis. 

Definition. 

A Soft Set is a mathematical abstraction designed 
to encapsulate uncertainty and fuzziness inherent in 
data within a specific domain of discourse. Let's 
break down this definition: 

Firstly, we define a universe of discourse, denoted 
as U, which encompasses all conceivable elements 
or entities relevant to the context under 
consideration. The power set of U, represented as 
P(U), comprises all possible subsets derived from 
the elements within the universe of discourse. 
Essentially, it represents the complete range of 
potential combinations or groupings of elements 
from U. 

Next, we introduce a set of attributes, denoted as 
A, which serves to characterize the properties or 


features associated with the elements within the 
universe U. These attributes could represent any 
discernible traits, qualities, or characteristics 
relevant to the domain being studied. 

Now, a Soft Set is formally defined as a pair 
(F, U), where F: A > P(U). 

F represents a mapping function that associates 
each attribute in A with a subset of elements from 
the universe U. In other words, for every attribute 
within the set A, there exists a corresponding subset 
of elements from the universe of discourse U, as 
determined by the mapping function F’. 

In summary, a Soft Set provides a structured 
framework for capturing and managing uncertainty 
by linking attributes to subsets of elements within a 
given universe of discourse. This enables the 
representation and manipulation of imprecise or 
indeterminate data, facilitating various 
computational tasks such as decision-making, 
pattern recognition, and data analysis within the 
specified domain. 


Example. 
Let U= {Helen, George, Mary, Richard} 


and a set M = {Helen, Mary, Richard} 
included in U. 
Let the attribute be: a = size, and its 
attribute’ values respectively: 
Size = A; = {small, medium, tall}. 

Let the function be: F: 4; — P(U). Then, 

for example: 
F(tall) = {Helen, Mary}, 
which means that both Helen and Mary are tall. 


B. INDETERMSOFT SET 
Definition. 

An IndetermSoft Set expands upon the 
foundational principles of the classical Soft Set by 
accommodating indeterminate data, reflecting the 
inherent uncertainty and ambiguity prevalent in 
real-world scenarios. Let's dissect this definition: 

We begin with the establishment of a universe of 
discourse, denoted as U, which encompasses all 
relevant elements or entities under consideration. 
Additionally, we identify a non-empty subset of U, 
denoted as H, and its corresponding powerset, 
P(H), which comprises all possible subsets derived 
from the elements within H. 

Furthermore, we introduce an attribute, denoted 
as 'a', and a set of attribute-values, denoted as A. 

The mapping function F: A — P(H) is 
designated as an IndetermSoft Set if one or more 
of the following conditions are met: 
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i) The set A_ exhibits some level of 
indeterminacy. 
ii) The sets H or P(H)_ demonstrate 
indeterminacy. 


iii) The function F itself contains elements of 
indeterminacy, indicating the presence of 
attribute-values for which the mapping is unclear, 
incomplete, conflicting, or non-unique. 

IndetermSoft Sets, characterized by their 
capacity to handle indeterminate data, arise from 
real-world situations where information sources 
may provide approximate, uncertain, incomplete, 
or conflicting data. Rather than introducing 
indeterminacy artificially, such as in the classical 
Soft Set framework, the indeterminacy is 
identified within the data itself, reflecting the 
limitations and nuances of our world. 

The term 'Indeterm' signifies 'Indeterminate,' 
encompassing attributes of uncertainty, conflict, 
incompleteness, or lack of uniqueness within the 
outcomes. This distinction prompts the 
consideration of determinate versus indeterminate 
operators, leading to the development of an 
IndetermSoft Algebra. 

Smarandache's contributions extend the concept 
further with the introduction of HyperSoft sets, 
which involve multi-attribute functions, and 
subsequently, the hybridization of various Soft Set 
variants. These hybrids incorporate elements from 
Crisp, Fuzzy, Intuitionistic Fuzzy, Neutrosophic, 
and other fuzzy extensions, as well as the 
Plithogenic HyperSoft Set. 

While the classical Soft Set relies on determinate 
functions with certain and unique values, the 
reality of our world often involves sources that 
provide indeterminate information due to a lack of 
knowledge or precision. Consequently, operators 
with varying degrees of indeterminacy are utilized 
to model such scenarios, acknowledging the 
inherent imprecision of our environment. 


Example. 
Assume a town has many houses. 


1) Indeterminacy with respect to the 
function. 

la) You ask a source: 

— What houses have the red color in the 
town? 

The source: 

—Iam not sure, I think the houses h1 or 
h2. 


Therefore, F(red) = hl or h2 
(indeterminate / uncertain answer). 

1b) You ask again: 

— But, what houses are yellow? 

The source: 

— I do not know, the only thing I know 
is that the house h5 is not yellow because I 
have visited it. 

Therefore, F(yellow) = not h5 (again 
indeterminate / uncertain answer). 

1c) Another question you ask: 

— Then what houses are blue? 

The source: 

— For sure, either h8 or h9. 

Therefore, F(blue) = either h8 or h9 
(again indeterminate / uncertain answer). 

2) Indeterminacy with respect to the set H 
of houses. 

You ask the source: 

— How many houses are in the town? 

The source: 

— I never counted them, but I estimate 
their number to be between 100-120 houses. 

3) Indeterminacy with respect to the set A 
of attributes. 

You ask the source: 

— What are all colors of the houses? 

The source: 


—I know for sure that there are houses of colors 
red, yellow, and blue, but I do not know if there are 
houses of other colors (?) 

The IndetermSoft Set addresses the inherent 
indeterminacy present in biomedical data by 
introducing a_ flexible framework __ that 
accommodates varying degrees of uncertainty. 
Through the incorporation of indeterminacy 
measures, the IndetermSoft Set offers researchers 
the ability to effectively manage and quantify 
uncertainty, facilitating more robust decision- 
making processes and knowledge discovery. 


C. HYPERSOFT SET 
Definition. 

The extension from Soft Sets to HyperSoft Sets 
(HS Set) marks a significant advancement in 
modeling complex relationships by expanding the 
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mapping function to accommodate multiple 
attributes. Here's a breakdown: 

Initially, the Soft Set concept is broadened into 
the realm of HyperSoft Sets by transitioning the 
mapping function F into a multi-attribute function. 
This transformation enables the representation of 
intricate relationships between elements within the 
universe of discourse. 

Let's delve into the formal definition: 

We begin with the universe of discourse, denoted 
as U, along with its powerset, P(U), which 
encompasses all conceivable elements or entities. 

Next, we introduce n distinct attributes, denoted 
as aj, a2, ... , An, for n => 1. Each attribute is 
associated with a set of attribute values, denoted 
respectively as A7, Az, ..., An, with 4; N A; = O, for 
i#j, andi, jin {1,2,...,n}. 

Notably, these attribute sets are pairwise disjoint, 
ensuring no overlap between them. 

The pair (F, A; X Az X ... X An) represent a 
HyperSoft Set over U, where F is a mapping 
function defined on the Cartesian product of the 
attribute sets where A; x Az x ... x An. 

Formally, F: A; x Az x ... X An — P(U), is called 
a 

— P(U), signifies that for each combination of 
attribute values, there exists a corresponding subset 
of elements from U. 

The introduction of HyperSoft Sets facilitates the 
exploration of complex relationships and 
interactions among multiple attributes within the 
universe of discourse. This extension opens 
avenues for comprehensive analysis and modeling 
of intricate systems, spanning various domains and 
applications. 

Moreover, Smarandache's contributions have led 
to the hybridization of HyperSoft Sets with diverse 
frameworks, including Crisp, Fuzzy, Intuitionistic 
Fuzzy, Neutrosophic, and other fuzzy extensions, as 
well as the Plithogenic Set. These hybrid models 
integrate elements from different mathematical 
paradigms, enhancing their adaptability and utility 
in addressing real-world complexities. 

In essence, HyperSoft Sets offer a versatile and 
robust framework for modeling and analyzing 
complex systems characterized by multiple 
attributes, thereby facilitating informed decision- 
making and knowledge discovery across diverse 
domains. 


Example. 

Let the attributes be: a; = size, a2 = color, 
a3 = gender, az = nationality, and their 
attributes’ values respectively: 

Size = A; = {small, medium, tall}, 


Color = A2 = {white, yellow, red, black}, 

Gender = A3 = {male, female}, 

Nationality = Az = {American, French, 
Spanish, Italian, Chinese}. 

Let the function be: F: A; <x Az x A3 x Ag 
— P(U). 

Then, for example: 


F({tall, white, female, Italian}) = {Helen, Mary}, 
which means that both Helen and Mary are tall, and 
white, and female, and Italian. 

Basically, this is an extension of the previous Real 


Example of Soft Set. 
The HyperSoft Set extends the foundational 
principles of Soft Sets by incorporating 


hyperparameters that capture complex relationships 
and interactions within biomedical datasets. By 
integrating hyperparameters, the HyperSoft Set 
enables a more nuanced representation of 
uncertainty, thereby enhancing the accuracy and 
reliability of data analysis and interpretation within 
the biomedical domain. 


D. SUPERHYPERSOFT SET 

Definition 

The SuperHyperSoft Set (SHS Set) is an 
extension of the HyperSoft Set. As for the 


SuperHyperAlgebra, SuperHyperGraph, 
SuperHyperTopology and in_ general for 
SuperHyperStructure and Neutrosophic 


SuperHyperStructure (that includes indeterminacy) 
in any field of knowledge, “Super” stands for 
working on the powersets (instead of sets) of the 
attribute value sets. 

Let U be a universe of discourse, P(U) the 
powerset of U. 

Let ai, a2, ..., An, for n = 1, be n distinct 
attributes, whose corresponding attribute values are 
respectively the sets A1, Az, ..., An, with Ai M Aj= 
0, fori#j, andi, 7 € {1,2,...,n}. 

Let P(A1), P(A2), ..., P(An) be the powersets of 
the sets A1, A2, ..., An respectively. Then the pair 

(F, P(A1) x P(A2)x...xP(An), where x meaning 
Cartesian product, or: 

F: P(A1)xP(A2)x...x P(An) — P(U) 
is called a SuperHyperSoft Set. 


Example. 

If we define the function: 

F: P(Aj) x P(A2) x P(A3) x P(Aa) > P(U). 

We get a SuperHyperSoft Set. 

Let’s assume, from the previous example, that: 
F({medium, tall}, {white,red, black}, {female}, {Am 
erican,Italian}) = {x1,x2}, 
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which means that: F({medium or tall} and {white 
or red or black} and {female} and {American or 
Italian}) = {x1,x2}. 

Therefore, the SuperHyperSoft Set offers a 
larger variety of selections, so x; and x2 may be: 
either medium, or tall (but not small), either white, 
or red, or black (but not yellow), mandatory female 
(not male), and either American, or Italian (but not 
French, Spanish, Chinese). 

In this example there are: Card{medium, tall} - 
Card{white, red, black} Card {female} 
Card { American, Italian} = 2:3-1-2 =12 possibilities, 
where Card { } means cardinal of the set { }. 

This is closer to our everyday life, since for 
example, when selecting something, we have not 
been too strict, but accepting some variations (for 
example: medium or tall, white or red or black, 
etc.). 

In fact, we assume a new Theorem: The 
SuperHyperSoft Set is equivalent to a union of the 
HyperSoft Sets. 

Demonstration: 

Let’s consider the SuperHyperSoft: 

F: P(A1) X P(Az) X ...X P(An) > P(U) 

Assume that the non-empty sets. 

Bic At, B2 c Az, ..., Bn C An and 

F (Bi, Ba, ..., Bn) € PCD) 

Bi = {bu1, biz, ...}, B2 = {b21, bzz, .. 
{bn1, bna, ...}, therefore 

F({{b11, biz, ...}, {b21, b22,...}, ..., {bn1, bnz, ...}) 
can be composed in many 

F (By, By --»Dy,) € PCU), which are actually 

HyperSoft Sets. 

If we reconsider the previous example, then: 

({medium or tall} and {white or red or black} 
and {female} and {American or Italian}) 
produces 12 

possibilities: 

. medium, white, female, American; 
. medium, white, female, Italian; 
. medium, red, female, American; 
. medium, red, female, Italian; 
. medium, black, female, American; 
. medium, black, female, Italian; 
tall, white, female, American; 

8. tall, white, female, Italian; 

9. tall, red, female, American; 

10. tall, red, female, Italian; 

11. tall, black, female, American; 

12. fall, black, female, Italian. 

Whence F of each of them is equal to {x1, x2}, 
or: 

F(medium, white, female, American) = {x1, x2} 

F(medium, white, female, Italian) = {x1, x2} 

F(tall, black, female, Italian) = {x1, x2} 
and all 12 are HyperSoft Sets. 


dy iaeay Bn = 


NANRWN— 


E. FUZZY-EXTENSION- 
SUPERHYPERSOFT SET 

Definition 

F: P(A\) X P(A2) x...x P(An) — P(U(x(do))) 
where x(do) is the fuzzy or any fuzzy extension 
degree of appurtenance of the element x to the set 
U 


Fuzzy-Extensions mean all types of fuzzy sets 
[Smarandache, 2023], such as: Fuzzy Set, 
Intuitionistic Fuzzy Set, Inconsistent Intuitionistic 
Fuzzy Set (Picture Fuzzy Set, Ternary Fuzzy Set), 
Pythagorean Fuzzy Set (Atanassov’s Intuitionistic 
Fuzzy Set of second type), Fermatean Fuzzy Set, q- 
Rung Orthopair Fuzzy Set, Spherical Fuzzy Set, n- 
HyperSpherical Fuzzy Set, Neutrosophic Set, 
Spherical Neutrosophic Set, Refined 
Fuzzy/Intuitionistic Fuzzy/Neutrosophic/other 
fuzzy extension Sets, Plithogenic Set, etc. 


Example. 

In the previous example, taking the degree of a 
generic element x(do) as neutrosophic, one gets the 
Neutrosophic SuperHyperSoft Set. 

Assume, that: 

F({medium, tall}, {white,red,black}, {female}, {A 
merican, Italian} ) {x1(0.7, 0.4, 0.1), 
x2(0.9,0.2,0.3)}. 

Which means that: x1 with respect to the = values 
({medium or tall} and {white or red or black} and 
{female}, and {American or Italian}) has the 
degree of appurtenance to the set 0.7, the 
indeterminate degree of appurtenance 0.4, and the 
degree of non-appurtenance 0.1. While x2 has the 
degree of appurtenance to the set 0.9, the 
indeterminate degree of appurtenance 0.2, and the 
degree of non-appurtenance 0.3. 


F. INDETERMHYPERSOFT SET 
Definition 

The IndetermHyperSoft Set represents an 
extension of the HyperSoft Set to accommodate 
indeterminate data, functions, or sets. Here's a 
refined explanation: 

We start with the universe of discourse, denoted 
as U, along with a non-empty subset H of U, and 
its powerset, P(H), which encompasses all 
possible subsets of H. 

Next, we introduce n distinct attributes, denoted 


as a1, a2, ..., An, forn>1. 
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Each attribute is associated with a set of 
attribute values, denoted as A), Az, ..., An, with A; 
NA; = fori#/j, andi, jin {1,2,..., nm}. 

Notably, these attribute sets are pairwise 
disjoint, ensuring no overlap between them. 

Then the pair (F, A; x Az x ... X An), where F: 
Ai xX Az x x A, — P(H) represents an 
IndetermHyperSoft Set over U if at least one of the 
following conditions holds: 

i) At least one of the attribute sets 47, Az, ..., An 
has some indeterminacy. 

ii) The sets H or P(H) exhibit indeterminacy. 

iii) There exists at least one n-tuple (e7, e2, ..., 
€n) € Ay X A? X ... X An Such that the function F(e7, 
a2, ..., @n) = indeterminate (unclear, uncertain, 
conflicting, or not unique). In other words, F yields 
an indeterminate outcome for that tuple. 


In essence, the IndetermHyperSoft Set extends 
the HyperSoft Set framework to accommodate 
situations where uncertainty or vagueness is present 
in the attribute sets, subsets, or the mapping 
function itself. 

Moreover, the IndetermHyperSoft Set provides a 
flexible and adaptable approach for modeling and 
analyzing complex systems in which precise 
information may be lacking or uncertain. By 
incorporating indeterminate elements, functions, or 
sets, this extension enhances the applicability of the 
HyperSoft Set framework in real-world scenarios 
characterized by inherent uncertainty or ambiguity. 


Example. 
Assume a town has many houses. 
1) Indeterminacy with respect to the function. 
la) You ask a source: 
— What houses are of red color and big size in 
the town? 
The source: 


—Jlam not sure, I think the houses /7 or hz. 

Therefore, F(red, big) = h; or h2 (indeterminate 
/ uncertain answer). 

1b) You ask again: 

— But, what houses are yellow and small? 

The source: 

—1I do not know, the only thing I know is that 
the house h5 is neither yellow nor small because I 
have visited it. 


Therefore, F(yellow, small) = not hs (again 
indeterminate / uncertain answer). 

1c) Another question you ask: 

— Then what houses are blue and big? 

The source: 

— For sure, either hg or ho. 

Therefore, F(blue, big) = either hg or ho (again 
indeterminate / uncertain answer). 

2) Indeterminacy with respect to the set H of 
houses. 

You ask the source: 

— How many houses are in the town? 

The source: 

—TI never counted them, but I estimate their 
number to be between 100-120 houses. 

3) Indeterminacy with respect to the product set 
A X A2 x ... X An of attributes. 

You ask the source: 

— What are all colors and sizes of the houses? 

The source: 


— I know for sure that there are houses of colors 
of red, yellow, and blue, but I do not know if there 
are houses of other colors (?) About the size, I saw 
many houses that are small, but I do not remember 
to have seeing big houses. 

Combining the strengths of both the IndetermSoft 
Set and the HyperSoft Set, the IndetermHyperSoft 
Set provides a comprehensive framework for 


analyzing complex biomedical _ datasets 
characterized by both uncertainty and 
hyperparameters. 


By synergistically integrating indeterminacy 
measures and hyperparameters, this extension 
empowers researchers to unravel _ intricate 
relationships and patterns within biological data, 
thereby advancing our understanding of biological 
systems. 


G. TREESOFT SET 
Definition. 

The TreeSoft Set is an innovative extension that 
introduces a hierarchical structure to Soft Sets, 
providing a comprehensive framework for 
modeling complex systems with multiple levels of 
attributes. Here's a refined explanation: 

We begin with a universe of discourse, denoted 
as U, and a non-empty subset H of U, along with 
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its powerset, P(H), which encompasses all 
possible subsets of H. 

Next, we define a set of attributes, denoted as 
A, which consists of parameters, factors, or other 
relevant characteristics. This set is organized 
hierarchically into levels: first-level attributes A= 
{A}, Az, ..., An}, for integer n > 1, where A), Az, ... 
, An are considered attributes of first level (since 
they have one-digit indexes). 

Each attribute A;, 1 < i <n, is formed by sub- 
attributes: 

Ai = {A11, A12, ... } 

A2= {A2,1, A22, ... } 

An = {An1, An2, .. } 

where the above Aj; are sub-attributes (or 


attributes of second level) (since they have two- 
digit indexes). 

Again, each sub-attribute A;,; is formed by sub- 
sub-attributes (or attributes of third level): 

Ai,j,k 

And so on, as much refinement as needed into 
each application, up to sub-sub-...-sub-attributes 
(or attributes of m-level (or having m digits into the 


indexes): 
Ait, i2,.-.5im 
This hierarchical structure forms a graph-tree, 
denoted as Tree(A), with A as the root node (level 
zero), followed by nodes at levels 1 to m, where m 
represents the maximum level of refinement. The 
leaves of this graph-tree are terminal nodes that 
have no descendants. 
The TreeSoft Set, denoted as: 
F: P(Tree(A)) — P(A), 
maps subsets of the graph-tree Tree(A) to 
subsets of H. The powerset P(Tree(A)) 
encompasses all possible subsets of the graph- 
tree. 
All node sets of the TreeSoft Set of level m are: 
Tree(A) = {Aiz| u= 1, 2, ... } 
The sets within the TreeSoft Set correspond to 


nodes at each level of the graph-tree: the first set 
consists of nodes at level 1, the second set consists 


of nodes at level 2, and so on, up to the last set 
comprising nodes at level m. If the graph-tree has 
only two levels (m = 2), then the TreeSoft Set 
simplifies to a MultiSoft Set [Smarandache, 2018]. 

In summary, the TreeSoft Set provides a 
structured approach for representing and analyzing 
complex systems with hierarchical attributes. By 
incorporating a hierarchical organization, it 
enhances the flexibility and expressiveness of Soft 
Set-based methodologies, enabling more nuanced 
modeling and analysis of multi-level systems across 
various domains. 

An illustrative example of a classical tree is 
shown in Fig.1. 


Attributes 


SIZE LOCATION 
SMALL BIG ARIZONA | CALIFORNIA 
| PHOENIX TUCSON 
FIGURE 1. Schematic representation of the 


TreeSoft Set of Level 3 framework, illustrating the 
incorporation of hyperparameters to capture 
complex relationships within biomedical datasets. 


This tree contains three levels as followed: 
Level 0 (the root) is the node Attributes; 


Level | is formed by the nodes: Size, 
Location; 

Level 2 is formed by the nodes Small, Big, 
Arizona, and California; 

Level 3 is formed by the nodes Phoenix, 
Tucson. 

Let’s consider H = {h7, hz, ..., hio} be a set of 
houses, and P(H) the power set of H. 

And the set of Attributes: A = £47, A2}, where 
A, = Size, A2 = Location. 

Then A; = {A11, Ajo} = {Small, Big}, A2 = 
{A21, A22} = {Arizona, California} as American 
states. 
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Further on, A22 = {A217, A212} = {Phoenix, 
Tucson} as Arizonian cities. 

Let’s assume that the function F gets the 
following values: 

F(Big, Arizona, Phoenix} = { ho, hio } 

F(Big, Arizona, Tucson) = { hj, hz, h3, ha } 

F(Big, Arizona) = all big houses from both 


cities, Phoenix and Tucson 
= F(Big, Arizona, Phoenix) U F(Big, Arizona, 
Tucson) = { hy, hz, h3, ha, ho, hio }. 


The TreeSoft Set introduces a_ hierarchical 
structure to Soft Set methodologies, enabling the 
representation and analysis of complex biological 
data in a hierarchical manner. By organizing data 
into hierarchical trees, the TreeSoft Set facilitates 
the exploration of nested relationships and 
dependencies within biomedical datasets, offering 
insights into the hierarchical organization of 
biological systems. 


IV. PILOT STUDY 


V. DISCUSSION 

While the extensions of Soft Sets, such as 
HyperSoft Set, SuperHyperSoft Set, IndetermSoft 
Set, IndetermHyperSoft Set, and TreeSoft Set, 
offer significant advancements in modeling 
complex systems and handling uncertainty within 
biomedical data analysis, several unresolved 
aspects warrant further exploration: 

e Integration of Extensions: Although each 
extension provides unique capabilities and 
applications, there is a need to explore how these 
extensions can be integrated synergistically to 
address multifaceted challenges in biomedical 
research. Investigating the interoperability and 
complementary nature of different Soft Set 
extensions could lead to more comprehensive 
methodologies for data analysis and decision- 
making, thereby enhancing the accuracy and 
reliability of diagnostics and _ personalized 
treatments. 

e Handling Complex Relationships: While 
HyperSoft Sets and TreeSoft Sets enable the 
representation of complex relationships among 
attributes, there remains a challenge in effectively 
managing and analyzing these intricate networks. 
Future research should focus on developing 
advanced algorithms and techniques for extracting 
meaningful insights from interconnected data 
structures, particularly in the context of 
biomedical datasets characterized by multi-level 
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dependencies. By understanding and modeling 
these complex relationships, clinicians and 
researchers can gain deeper insights into disease 
mechanisms and treatment responses, ultimately 
improving diagnostics and personalized treatment 
strategies. 

e Quantification of Indeterminacy: The 
presence of indeterminate data in IndetermSoft 
Sets and IndetermHyperSoft Sets poses challenges 
in quantifying and interpreting uncertainty. 
Further investigations are needed to develop 
robust methodologies for measuring and 
representing different degrees of indeterminacy, 
enhancing the reliability and interpretability of 
results derived from these frameworks. By 
accurately quantifying uncertainty, clinicians can 
make more informed decisions regarding 
diagnosis and treatment selection, taking into 
account the inherent variability and ambiguity 
present in biomedical data. 

e Scalability and Efficiency: As biomedical 
datasets continue to grow in size and complexity, 
there is a pressing need for scalable and efficient 
algorithms capable of handling large-scale data 
analysis tasks. Research efforts should focus on 
optimizing computational techniques and resource 
allocation strategies to ensure the scalability and 
efficiency of Soft Set-based methodologies in real- 
world applications. By improving the scalability 
and efficiency of data analysis techniques, 
clinicians and researchers can analyze large 
datasets more effectively, leading to faster and 
more accurate diagnostics and personalized 
treatment recommendations. 

e Validation and Benchmarking: Despite the 
theoretical advancements in Soft Sets and their 
extensions, there is a lack of comprehensive 
validation frameworks and benchmark datasets for 
evaluating the performance of _ these 
methodologies. Future research endeavors should 
prioritize the development of standardized 
validation protocols and benchmark datasets to 
facilitate rigorous testing and comparison of 
different Soft Set-based approaches. By validating 
Soft Set-based models using standardized 
protocols and benchmark datasets, clinicians and 
researchers can ensure the reliability and 
generalizability of diagnostic and treatment 


recommendations derived from these 
methodologies. 
e Interpretability and Transparency: 


Enhancing the interpretability and transparency of 
Soft Set-based models is crucial for fostering trust 
and adoption in biomedical research and clinical 
practice. Researchers should explore techniques 
for explaining model decisions and capturing the 
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underlying uncertainty in a transparent manner, 
enabling stakeholders to understand and trust the 
insights derived from these methodologies. By 
improving the interpretability and transparency of 
Soft Set-based models, clinicians can better 
understand the rationale behind diagnostic and 
treatment recommendations, leading to increased 
confidence in personalized treatment strategies 
and ultimately improving patient outcomes. 


Vi. CONCLUSION 

The evolution and adoption of Soft Sets and 
their various extensions, including the HyperSoft 
Set, IndetermSoft Set, IndetermHyperSoft Set, and 
TreeSoft Set, represent a significant leap forward 
in computational methodologies, particularly 
within the realm of bioinformatics. These 
extensions provide novel avenues for modeling 
and analyzing complex biological datasets, which 
are often plagued by uncertainty, imprecision, and 
vagueness. 

In the context of bioinformatics, where data are 
diverse and frequently noisy or incomplete, the 
adaptability and versatility of Soft Sets and their 
variants prove invaluable. They offer researchers 
the means to navigate the inherent uncertainties 
present in biological data, such as those arising 
from gene expression profiles, protein 
interactions, and metabolic pathways. By 
embracing the inherent fuzziness and imprecision 
of biological phenomena, Soft Sets empower 
researchers to conduct more accurate and robust 
analyses, thus uncovering deeper insights into 
biological systems. 

Moreover, the flexibility of Soft Sets enables 
seamless integration with other computational 
techniques and algorithms commonly employed in 
bioinformatics. This integration amplifies their 
utility, allowing researchers to combine the 
strengths of Soft Sets with established methods, 
thereby enriching analyses and__ bolstering 
decision-making processes. 

In conclusion, Soft Sets and their extensions 
play a pivotal role in bioinformatics, offering a 
versatile framework to tackle the complexities and 
uncertainties inherent in biological data. Through 
their application, these methodologies drive 
innovation, deepen our understanding of 
biological systems, and ultimately contribute to 
advancements in human health and well-being. 
Our analysis underscores the efficacy of Soft Sets 
and their extensions in handling the intricacies of 
biomedical data, enabling more accurate 
predictions and facilitating insightful discoveries 
that propel biomedical research forward. 
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